Isotropic Representation Can Improve Dense Retrieval

نویسندگان

چکیده

The latest Dense Retrieval (DR) models typically encode queries and documents using BERT subsequently apply a cosine similarity-based scoring to determine the relevance. representations, however, are known follow an anisotropic distribution of narrow cone shape such can be undesirable for relevance estimation. In this work, we first show that representations in DR also distribution. We adopt unsupervised post-processing methods Normalizing Flow whitening cope with problem, develop token-wise method addition sequence-wise method. proposed effectively enhance isotropy thereby improving performance as ColBERT RepBERT. To examine potential isotropic representation robustness models, investigate out-of-distribution tasks where test dataset differs from training dataset. results certainly achieve generally improved (The code is available at https://github.com/SNU-DRL/IsotropicIR.git ).

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Relational labels can improve relational retrieval

Retrieval that is based on common relational structure, such as an underlying principle or pattern, is useful but typically rare. Based on evidence that comparison-derived schema abstraction can improve relational retrieval, we asked whether the use of relational labels can also promote abstraction and improve relational retrieval. Using a cued-recall paradigm, we varied the presence of relatio...

متن کامل

Linguistic Knowledge can Improve Information Retrieval

This article was published in the Proceedings of the Applied Natural Language Processing Conference (ANLP−2000) in Seattle, Washington, May 1−3, 2000. A preliminary version was published as a technical report (TR−99−83) in the Sun Microsystems Laboratories Technical Report Series. The article represents a milestone in an ongoing project aimed at discovering technology to help people find specif...

متن کامل

Indexing with WordNet synsets can improve text retrieval

The classical, vector space model for text retrieval is shown to give better results (up to 29% better in our experiments) if WordNet synsets are chosen as the indexing space, instead of word forms. This result is obtained for a manually disambiguated test collection (of queries and documents) derived from the Semcor semantic concordance. The sensitivity of retrieval performance to (automatic) ...

متن کامل

Selective memory retrieval can impair and improve retrieval of other memories.

Research from the past decades has shown that retrieval of a specific memory (e.g., retrieving part of a previous vacation) typically attenuates retrieval of other memories (e.g., memories for other details of the event), causing retrieval-induced forgetting. More recently, however, it has been shown that retrieval can both attenuate and aid recall of other memories (K.-H. T. Bäuml & A. Samenie...

متن کامل

Image Retrieval Via Isotropic and Anisotropic Mappings

This paper presents an approach for content-based image retrieval via isotropic and anisotropic mappings. Isotropic mappings are defined as mappings invariant to the action of the planar Euclidean group on the image space – invariant to the translation, rotation and reflection of image data, and hence, invariant to orientation and position. Anisotropic mappings, on the other hand, are defined a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2023

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-33380-4_10